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Introduction 


mi 


f — In  this  paper  w»  examined,  the  set  of  equilibrium  payoffs  in  a 

/  r 

repeated  gcme  when  there  are  bounds  on  the  complexity  of  the  strategies 
that  players  may  select. 

-  The  interest  in  putting  such  bounds  comes  from  the  limited  computa¬ 
tional  ability^  of  humans  and  devices  used  by  humans  ,  (see  Simon  [1957, 
1972]).  'For  example  most  of  the  strategies  in  a  repeated  game  cannot  be 
implemented  by  any  computer. 


"It  is  important  to  distinguish  between  the  complexity  of  a  strategy 
and  the  complexity  of  the  process  of  selecting  a  strategy.  -We  will' not 
deal  with  the  selection  process  directly.  We~wAi£  assume  that  the  limi¬ 
tation  of  a  player  is  such  that  he  can  consider  all  the  strategies  below 
a  certain  level  of  complexity.  A  possible  interpretation  is  that  the 
players’  abilities  are  unbounded  but  they  use  bounded  devices  to  imple¬ 
ment  their  strategies  (secretaries,  or  computers).  —  '■>  /  ■ 


*  This  work  was  partially  supported  by  Office  of  Naval  Research  Grant 
N000-14-86-K-0216  at  the  Institute  for  Mathematical  Studies  in  the 
Social  Sciences,  Stanford  University,  Stanford,  California.  This  work 
is  based  on  my  M.S.  thesis.  My  advisor,  Professor  Abraham  Neyman 
introduced  me  to  the  topic  and  guided  me.  I  owe  him  many  thanks. 


We  use  the  notion  of  a  finite  automaton  to  define  a  complexity 


measure  on  the  strategies.  A  finite  automaton  is  a  machine  that 
contains  a  finite  number  of  states.  One  of  these  states  is  the  initial 
state.  The  machine  has  an  action  function  and  a  transition  function. 

The  action  function  determines  the  one-shot  game  strategy  that  is  played 
at  each  state.  The  transition  function  specifies  the  next  state  as  a 
function  of  the  current  state  and  the  current  action  of  the  other 

- - i - — 

players. J  An  automaton  induces  a  strategy  as  follows:  The  state  at  the 
first  stage  is  the  initial  state.  The  state  and  the  actions  of  the 
other  players  at  stage  t  determine  the  state  at  stage  t  +  1  (by  the 
transition  function).  The  state  determines  the  one-shot  game  strategy 
(by  the  action  function).  The  size  of  an  automaton  is  the  number  of 
states  it  has.  The  complexity  of  a  strategy  is  defined  as  the  size  of 
the  minimal  automaton  that  can  implement  it.  The  main  result  Is  that  in 
a  zero-sum  game,  when  the  size  of  the  automata  of  both  players  go 
together  to  infinity  the  sequence  of  values  converges  to  the  value  of 
the  one-shot  game.  This  is  true  even  if  the  size  of  the  automata  of  one 
player  is  a  polynomial  of  the  size  of  the  automata  of  the  other  player. 
This  means  that  a  player  can  gain  something  from  being  able  to  use 
strategies  which  are  more  complicated  than  the  strategies  of  his 
opponent,  only  if  his  strategies  are  much  more  complicated.  The  result 
for  the  zero-sum  games  gives  an  estimation  for  the  general  case. 

The  idea  of  using  a  finite  automaton  in  order  to  distinguish  between 
simple  and  complicated  strategies  was  proposed  by  Aumann  [1981].  In  two 
recent  works  Neyman  [1985]  and  Rubinstein  [ 1 985 ]  used  the  notion  of  a 
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finite  automaton  to  explore  the  issue  of  strategic  complexity  in 
repeated  games.  Neyman  shows  that  in  the  finitely  repeated  prisoners 
dilemma,  putting  a  bound  on  the  complexity  of  the  strategies  can  induce 
cooperation.  Rubinstein  in  the  infinite  version,  does  not  restrict  the 
players  to  a  certain  level  of  complexity,  instead  players  seek  to  mini¬ 
mize  the  complexity  of  their  strategies  provided  they  do  not  decrease 
their  payoff.  This  eliminates  many  of  the  equilibrium  payoffs. 

The  model  in  the  present  study  was  proposed  to  me  by  Neyman. 

2.  The  Model 

12  i 

Let  G  be  a  zero-sum  game  G  =  (3  ,S  ,r);  where  S  is  a  finite 

1  2 

set  of  actions  for  players  i  and  r:  S  *  S  *  R  is  the  payoff  func¬ 
tion  of  player  I.  Let  V(G)  denote  the  value  of  the  game  and  let 

1  2 

maxmin  (G)  and  mlnmax  (G)  denote  max  -min  „  r(s  ,s  )  and 

s  es  s  GS^ 

/  1  2- 

min  „max  ,  .rls  ,s  )  respectively, 

x  es  s'es 

An  automaton  A*  for  player  i  is  a  four- tuple  <M1 ,q* , f 1 ,gi> 
where  M*  is  a  set,  q  e  M  f  * :  -*•  S*  and 

g:  M*  *  +  M*  (j  *  i).  M*  is  the  set  of  states  of  the  automaton, 

q1  is  the  initial  state,  fi(qi)  is  the  action  the  player  chooses  when 

the  automaton  is  at  state  q*  and  g*  is  the  transition  function,  if 

the  automaton  is  at  state  q*  and  the  other  player  chooses  s^  the 

next  state  is  Si(qi*s'^).  An  automaton  is  finite,  if  the  sets  of  states 

is  finite.  We  will  consider  only  finite  automata.  The  size  of  an 
automaton  is  the  number  of  the  states.  An  automaton  of  player  i 
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induces  a  (pure)  strategy  in  the  repeated  game  as  fellows:  The  action 
i,  i,  i 

at  stage  t  is  f  (q  J  where  q  is  the  state  of  the  automaton  at 

stage  t.  The  sequence  of  states  is  determined  inductively  by 

q*  =  q1  q  .  =  gi(q^,s^)  where  Is  the  action  of  the  other  player 

X  X+ I  L  u  u 

at  stage  t.  For  example  consider  the  game: 

L  R 

T  1  -1 

B  -1  1 

The  strategy  of  player  I  which  begins  with  T,  continues  with  it  as 
long  as  player  I  chooses  L  and  plays  B  forever  if  player  II 
plays  R,  is  induced  by  the  automaton  A  =  <M,q,f,g>;  where 

M  =  {l ,2} ;  q  =  1;  f(l)  =  T  f(2)  =  B 

g(1,L)  =  1  g(l,R)  =  2  g(2,L)  =  g(2,R)  =  2. 

1  2 

Given  that  the  automata  of  the  players  are  A  and  A  the  correspond¬ 
ing  strategies  in  the  repeated  game  determine  a  sequence  of  actions  and 

payoffs.  Denote  by  Rt(A'fA  ;  the  payoff  at  stage  t.  The  payoff  when 

1  2 

player  I  chooses  A  and  player  II  chooses  A  is  defined  to  be  the 

2/ 

limit  of  the  means—'  : 

(1)  R(A1,A2)  =  lim  £  l  R.(A1fA2) 

T+®  t=1 

We  are  interested  in  the  value  of  the  game  where  each  player  is 
restricted  to  strategies  that  can  be  implemented  by  an  automaton  of  a 


given  size 
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Formally,  defines 


=  {A1  lA1  is  an  automaton  of  size  n  for  player  i} 

r  2,2  , 

=  |A  is  an  automaton  of  size  m  for  player  II) 


(An,Am,R)‘ 


Thus  Gn  m  is  the  game  induced  by  restricting  player  I  and  player  II  to 

strategies  that  can  be  implemented  by  automata  of  size  n  and  m 

respectively.  Note  that  Gn  m  is  also  a  zero-sum  game. 

I  assume  without  loss  of  generality  that  the  set  of  states  of  an 

automaton  of  size  n  is  {l,...,n}.  With  this  identification  A 1  and 

n 

2  IP 

Am  are  finite.  Let  k  =  |S  |  and  h  =  |S‘|.  Then 

,  1  i  n  n*h  .2.  m  mxk 

|A  |  =  n  •  k  •  n  and  |A  |  =  m  •  h  •  m 


3.  The  Asymptotic  Behavior  of  V(G  ) 

The  main  result  in  this  section  is  that  if  p(n)  is  a  polynomial 

lim  V(G  .  J  =  V(G). 
n-*»  n,p(n) 

First  note  that  player  I  can  get  at  least  the  maxmin  (G)  by 
playing  constantly  the  maxmin  action.  An  automaton  of  size  one  can 
implement  this  strategy  and  of  course  any  larger  automaton  can  do  it. 
Similarly  player  II  can  get  mlnmax  (G).  Thus  the  following  inequal¬ 
ities  hold 


maxmin  (G)  <  V(G  )  <  minmax  (G). 
n,m  — 
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The  first  result  states  that  for  any  number  n  there  exists  a  larger 

number  m  such  that  V(g  1  =  maxmin  G. 

n,m 

First  we  need  a  definition  and  a  lemma. 

Definition:  (a)  A*  is  a  partial  automaton  if  the  transition 

function  g  is  a  defined  on  a  subset  of  M*  *  S'*. 


(b)  For  two  partial  automata  A  and  A',  A'  is  an 
extension  of  A  if  its  transition  function  is  an  extension  of  the 
transition  function  of  A.  If  A'  is  an  automaton  it  will  be  called  a 
completion  of  A. 

1  1 

Lemma  (1.1):  For  every  A  €  An  there  exists  a  partial  automaton, 

2  -2  2 
A  of  size  n  such  that  for  any  completion  A  of  A 

(2)  r(a  ,  A2 )  <  maxmin  G 


1  1 
H(a  )  is  the  best  reaction  for  player  II  to  the  action  s  of  player 

I.  Denote  the  different  states  of  A^  by  {l,...n}  and  assume  that 

2 

the  initial  state  is  1.  Define  a  partial  automaton  A  as  follows: 


M2  =  {l,...,n} 


q2  =  1  f2(i)  =  h(f 1 (i ) )  1  <  i  <  n 
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g  is  a  partial  function  which  satisfies 


g2(i,f1(i))  =  g1(i,f2(i)) 


i  <  i  <  n 


It  is  easy  to  see  that  g  can  be  defined  in  such  a  way  and  that  every 
completion  of  satisfies  (2). 


2  1  3 

Theorem  Is  If  L(n)  >  n  |A  I  =  n 

* - -  —  1  n 1 


n*h 

n  there  exists  an  auto- 


2  2  1112 
maton  A  £  A,  ,  \  such  that  for  every  A  £  A  R(A  fA  )  <  maxmin  (G). 
L(nj  n  ~ 

2  2 

We  will  construct  A  .  Roughly  A  operates  a3  follows:  it 
identifies  the  automaton  A  and  then  uses  a  subautomaton  from  the  type 
described  in  lemma  1.1. 

1  1 

Proof:  Let  A0  1  .<  A  <  |A  _  I  be  an  automata  of  player  I.  With 

1  2  2 

every  A^  associate  r.  identical  partial  automata 

from  the  type  defined  in  lemma  1.1 

2  2 

The  set  of  states  of  any  two  different  partial  automata  A  £  A 

i.q 

(i  *  j  or  q  A  r)  are  disjoint.  Together  they  form  a  partial 
2  1 

automaton  with  n  )An)  states.  Define  the  initial  state  to  be  the 

2  —2 

initial  state  of  A.  1 .  Denote  this  partial  automaton  by  A  .  We  will 
1 » 1 

extend  the  automaton  inductively  so  that  the  partial  automaton  at  stage 


p,  Ap  will  satisfy 


1  2 

r(A  ,A  )  <  maxmin  (G) 
'  s  p'  — 
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Define 


2  -2 
A1  =  A 


1  2 

Let  p  >  1  and  play  A  .  and  A 

P+1  P 


Let  t  be  the  first  time  that 


1  11 
Ap+1  "reveals  his  identity"  i.e.  the  action  played  by  Ap+1  -  -  is 

,  1  1> 

such  that  the  sequence  (s,| , . . . ,  s^J  is  different  from  the  action 

,  1  1 

sequences  of  the  automata  A^,...,A  .  (If  such  a  t  doesn't  exist 

2  2 

define  A  .  =  A  . ) 

P+1  P 


Denote: 


the  state  of  A^  at  stage  t. 


*t+1 


1  1 
the  state  of  A  at  stage  t  +  1  (q^ 

determined  at  stage  t.  ) 


t+1 


is 


ai(p  +  1)  -  the  index  of  the  automaton  which  is  identical  to 

1  -1 

A  .  but  has  q  .  as  an  initial  state. 
p+1  Mt+1 


ci(p  +  1)  -  the  index  of  the  first  copy  of  the  automata 

2  2 

A  ,/  a\  A  .  ,  that  hasn't  been  used  yet. 

ai(p+1),1  ai(p+1),n 


(Formally  ci(p  +  1)  =  min  {j|  there  doesn't  exist  1  ^  r  ^  p  +  1 
such  that 


ai(r)  =  ai(p  +  1)  and  ci(r)  =  j}.) 
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2  2  1 

The  inductive  step  is  to  set  g  (q  ,  s  )  to  the  initial  state  of 

.2 

ai(p+1 ),ci(p+1 ) 

It  is  easy  to  see  that  this  definition  is  an  extension  and  (l)  is 
satisfied.  Finally  define 


2  2 
A  =  A 


Theorem  2;  Let  Q(n)  be  a  function  which  satisfies  Q(n)  2  n  and 

lim  Mikll  =  o  then  lim  V(G  .  J-V(G). 
n-*°  n  n-*»  n,Q(n) 

Proof :  For  each  n  we  will  define  a  mixed  strategy  Pn  for  player 

I  (i.e.  a  probability  distribution  on  An)»  such  that  for  every 

^  €  AQ(n)  ^  1  lpn^A^  *  *(AV)  2  V(G)  -  en  where  lim^  ^  =  0. 

A  G  A 

n 

1  2  In 

Assume  S  =  and  S  =  {l,...,h}.  Define  =  {s  }  . 

Ill 

Let  w  £Q  w  =  fw„....w  ).  Denote  by  A  (A  E  A  )  the  following 
n  '■I  n'  'w^wn' 

automaton: 

1  -1  1  fi+liCn 

M  -  {l,...,n},  q  -  1,  f  (i)  =  w  ,  g(i,s^)  = 

J  1  i  =  n 

Let  (P1f..»Pk)  be  a  mixed  strategy  for  player  I  in  the  game  G,  such 

k  2 

that  £  ^  P^  •  r(i,j)  _>  V(G)  for  every  j  6  S  .  Define  a  probability 


(on.2  n)  by 


x  (w)  =  n?  .  p  . 
n'  i=1  w , 


measure  on 
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The  strategy  Pn  of  player  I  in  the  game  Gn>Q(n)  is  defined  by 


^0  =  nn(w). 


We  have  to  show  that, 


(1)  For  every  e  >  0  there  exists  N(e)  such  that  for  every 

2  2 

n  >  N(e)  and  for  every  A  E  \  (1‘)  is  satisfied 


(1')  l  u  (w)r(a”\A2)  >  V(G)  -  e. 

w  eQ 

n 


It  suffices  to  show  that. 


(2)  for  every  e  >  0  there  exists  N(e)  such  that  for  every 
2  2 

n  >  N(e)  A  e  A,/  x  and  eg  N  (2' )  is  satisfied. 

Q(.n  j 


(2*) 


(c+1 )n 


w  e  Q 


X  i  l  Rt(A.^,A2)  >  V(G)  - 

—  r\  H  II  a.  .  a  «  * 


t=c*n+1 


2  2  2 

Let  A  G  s  and  assume  M  =  {l , . . . ,Q(n)  }.  Let 

QinJ 

2  2 
A  (p)  P  =  1,...,Q(n)  denote  the  automaton  which  is  identical  to  A 

but  with  initial  state  p.  Let  q(A  ,w,c)  denote  the  state  of  the 

automaton  A2  in  the  stage  c  •  n  +  1 ,  when  player  I  chooses  A^. 


Finally  defines 


(3) 


X  ?  (w) 

A  ,c 


def  1 

n 


(c+1 ) *n 

l 

t=c*n+1 


l  R  (A^A2(q(A2,w,c))) 
t=1  Z  " 


Note  that  the  left  expression  in  (2')  is  the  expectation  of  XA^>C.  To 
estimate  the  expectation  and  compare  it  to  V(G),  we  will  estimate 


(4) 


dn{w:  x  2  <  “  £}* 

A  ,c 


The  important  point  in  the  proof  is  that  the  state  of  A  in  stage 
SL,  1  <  SL  <  n  is  determined  by  the  first  l  -  1  actions  of  player  I, 
Thus,  with  every  action  j  =  1,...,h  of  player  II  we  can  associate  a 

sequence  of  random  variables  (f .  ) 

0 »  X-  » •  •  •  >n 


def 


(  1  the  action  of  A  at  stage  l 

when  it  plays  against  A  is  j 

w 


0  otherwise 


and  f.  .  are  measurable  w.r.t.  the  algebra  which  is  generated  by  the 
J  *  * 

first  l  -  1  elements. 

We  now  need  two  lemmas. 

2 

Given  an  automaton  A  ,  call  a  sequence  w  6  "nice",  if  the 
number  of  times  the  actions  (i,j)  are  played  divided  by  the  number  of 
times  j  is  played,  is  'close'  to  P^.  Lemma  2.1  says  that  'almost' 
all  the  sequences  are  "nice".  Lemma  2.2  states  that  on  a  "nice" 
sequence  the  average  payoff  is  'close'  to  V(G). 
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Lemma  2.1 :  Let  A  =  {alt...,ak}  be  a  finite  set.  Let  (A,2  ,Pj 

,  Q  .  ,  A  *n 

be  a  probability  space  and  let  (Q,2  ,tij  be  the  product  space  (A, 2  ,P  J 

i.e.  Q  =  A°  and  for  every  w  =  A  n(w)  =  Let 

Q 

be  the  partial  algebra  of  2  which  is  generated  by  w^  , . . .  ,w 

H  =  { <t> , £2} .  Let  {f  .  H”]  ’  *  *  * ,h  be  a  set  of  random  variables  which 

are  adapted  to  (h  )  ^  f.  »  fs  measurable  w.r.t. 

(Q.H^))  and  with  %'alues  in  the  set  {° » 1  } *  Define 

S  =  f  w:  |—  l  l(w  =  a  )(w)  •  f  ^w)  -  ~  l  P(a  )  •  t.  ,(w)|  >  e 

e  'n  A  i  n  ^  i  J*-*- 


for  some  1  <  j  <  h  1  ^  i  k 


There  exists  b  >  0  such  that  for  every  b  >  e  >  0 


p(Se)  <  2  •  k  •  h  •  e 


In  lemma  2.2  Q  and  f .  .  refer  to  {s  j  and  the  indicator 
n  0 » * 

functions  of  the  actions  respectively,  as  defined  before  Lemma  2.1.  let 


W(G)  denote  max  .99? 

s'  E S  ,  s^es" 


|r(s1,s2)|. 


Lemma  2.2:  Define 


M!  w  G  Qn’  I  n  A  =  ^(W)  *  f0,*(w)  "  n  1*1  *  fj,A(w)'>( 


for  some  1  £  j  <_  h  1  £  i  £  k 


If  w  Q  -  S  _  then  V(G)  -  -  7"  .R4.(a1,A2)  <  W(G)  •  k  •  h  •  e. 
n  ,2  v  '  n  H=1  t^  w’  J 

e,A 

The  proofs  of  the  lemmas  are  in  the  Appendix. 


We  can  now  evaluate  (4) 


{w:  X  (w)  <  V(G)  -  e}  C  U  {w:  X  (w)  <  V(G)  -  e} 

A2,c  “  p-1  A2(p),0 


Let  h  =  — r— ? - - - -  .  Note  that  by  lemma  2.2  if  »  6  2  -  S 

U<3)  •  k  •  h  "  „,»2(p> 

then  X  (w)  >  V(G)  -  e.  This  and  lemma  2.1  imply  that  for  every 

a2(p),o 


p  =  1 1 ... i Q(n) 


2 

-t)  •  n 


u  {ws  X  (w)  <  V(G)  -  e}  <  u  (S  „  )  <  2  •  k  •  h  •  e 

n  A2(p),0  n  T),  A2(p) 


From  this 


{w;  X  (w)  <  V(G)  -  e}  <  I  u  {w:  X  (w)  <  V(G)  -  e} 


n 1  2 

A  tc 


A  (p),0 


<  Q(n)  •  2  •  k  •  h 


4 


Since  lim^  (ao  ~  AnQ(n)  =  0,  for  every  h  >  0 


2 

-r)  1  n 


limn+<oQ(n)  •2»k*h,e 


=  0.  Hence  for  every  e  >  0  there 


exists  N(e)  such  that  for  any  n  >_  N(c),  £  ^Q(n)  and  c  e  N*  (5) 


is  satisfied. 


u  {w|X  (w)  <  V(G)  -  e}  <  e 

n  •  ^  _ 
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Since  X  are  uniformly  bounded,  i.e.  (Vc,Vn,VA2  €  aL  v  |x  (w)|  <. 

A  ,c  Q('n;  a  ,c 

W(G) )  we  get  (2). 

We  have  proved  lim  v(G  ,  v )  >  V(G).  Since  Q(n)  >  n  a 
-  n+  »  v  n,Q(n)  —  — 

similar  argument  shows  that  lim  v(g  ,  . )  <  V(G).  These 

n*®  n ,  Q  ( n )  - 

inequalities  imply  the  result. 

4.  Non  Zero-Sum  Games 

Theorem  2  gives  an  estimation  for  the  asymptotic  behavior  of  the  set 
of  equilibrium  payoffs  in  an  m-person  game.  Let  G  be  an  m-person 
game.  An  automaton  for  player  i  is  the  same  as  was  previously 
defined,  except  that  the  transition  function  is  defined  on  M*  x  S  * 
where  S  *  is  the  set  of  action  tuples  of  the  other  players.  Let 
S(z^,...,zm)  denote  the  set  of  equilibrium  payoffs  in  the  repeated  game 
when  player  i  is  restricted  to  strategies  that  can  be  implemented  by 
automata  of  size  Let  S.  denote  the  set  of  individual  rational 

payoffs  in  mixed  strategies.  Let  co  G  denote  the  convex  hull  of  the 
vectors  in  the  payoff  matrix  of  G.  Let  y  =  (y^,...,ym)  denote  the 
vector  of  maximal  payoffs  that  each  player  can  get  regardless  of  what 
the  other  players  do  (in  mixed  strategies).  Define 

S  =  {x:  x  e  oo  G  x  >  y}. 

Theorem  3:  Let  Q2(n) , . . .  .^(n)  be  functions  such  that  Q^n)^  n 
*n[Q,(n)] 

and  lim  _ _ * -  =  0.  Then 

S  C  lim  S(n9Q  (n) 9 • • • ,Q  (n) )  C  lim  ^^(S(n9 Q  (n ) 9 • • • p Q  (n) )  C 
—  —  —  n*^  2  m  —  n-H*  2  m  — 


t 


It  is  easy  to  see  that  every  rational  convex  combination  of  the  payoff 
vectors  in  G  can  be  implemented  by  automata  which  are  large  enough. 
Player  1  can  get  yA  by  using  the  strategy  that  was  defined  in 
Theorem  2.  The  other  players  can  bring  player  i  down  to  his 
individual  rational  payoff  by  using  the  same  type  of  strategies.  The 
proofs  of  the  last  two  claims  are  similar  to  the  proof  of  Theorem  2.  I 
omit  the  details. 

Corollary:  Let  G  be  a  2-person  game  and  let  Q(n)  be  a  function 

that  satifies  Q(n)  >  n  and  lim  J  _  q#  Then 

—  n+“  n 

lim  S(n,Q(n))  =  S  =  5. 
n-x» 


5.  Conclusion 

So  far  we  have  considered  a  specific  measure  of  complexity.  However 
a  version  of  Theorem  2  is  true  for  a  large  class  of  measures. 

Definition:  A  function  g:  N  ♦  N  is  log  -  polynomial  bounded 
(henceforth  l.p.b.)  if  there  exists  a  polynomial  p  such  that 
An(g(n))  <  p(£n(n)). 

Note  that  every  polynomial  is  a  l.p.b. 

Consider  a  complexity  measure  as  a  function  from  the  set  of 
strategies  to  the  natural  numbers. 

Definition:  Two  measures  of  complexity  are  log  -  polynomial 

equivalent  if  there  exist  l.p.b.  functions  g^(n)  g2(n)  _>  n  such  that 
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C^fxs  x  <  n)  CC^  {x:  x<gl(n)} 

—  1  —1 
C2  {xj  x  <  n}  C  C1  (x:  x  <  g2(n) } 

C 

Let  C  be  a  complexity  measure.  G  denotes  a  game  that  is  similar 

n ,  m 

to  Gn  m  except  that  the  complexity  of  the  strategies  is  measured  by 
C.  (So  player  I,  for  example,  is  restricted  to  strategies  that 
according  to  C  have  a  complexity  that  is  less  than  n. ) 


Theorem  4x  Let  C  be  a  complexity  measure  that  is  log-polynomial 
equivalent  to  the  automata  measure  and  let  Q(n)  be  a  l.p.b.  function 
that  satisfies  Q(n)  >  n.  Then 


lim  V(G 

n-*» 


C 

n,Q(n) 


)  =  V(G). 


Proof :  There  exists  a  l.p.b.  function  g^  such  that  player  II  is 
restricted  to  strategies  that  can  be  implemented  by  an  automaton  of 
size  gl(Q(n)).  Since  and  Q  are  l.p.b.  there  exists 

polynomials  p^  p2 

such  that  An[g1 (Q(n) ) J  £  p^ [in(Q(n) ) ]  and  *n[Q(n)  ]  <  p^finCn)  ]. 
Putting  this  together  gives: 


(1)  An[g1 (Q(n)) ]  <  P1 [p2(ln(n))  ]. 

There  exists  a  l.p.b.  function  g2  such  that  for  every  x  £  N  that 
satisfies  g2(x)  _<  n  player  I  can  use  any  strategy  that  can  be  imple¬ 
mented  by  an  automaton  of  size  x.  Let  m  be  the  largest  number  such 
that  m  <  n  and  g2(m)  n.  We  have  +  1  >  n.  Since  g2  is 
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l.p.b.  there  exists  a  polynomial  p^  such  that  ln[g^(®)  +  1  ]  _<  [ An (n» ) 

Putting  together  with  (1)  we  get: 

■*«[g1  (Q(n) )  ]  <  p1  (p2(p3(ln(m)) )). 

A  composition  of  polynomials  is  a  polynomial  hence 

Pl[p2[p, £*»<■)]]] 


Theorem  2  implies  lim  v(G  ,  n)  >  V(G). 

-  n+*°  v  n,Q(n)  — 

_  c 

Since  Q(n)  ^  n  a  similar  calculation  gives  lim  n>0Dv(Gn  Q(n)  )  ~  V(G) 
The  last  two  inequalities  imply  the  result. 


APPENDIX 


Proof  of  Lemma  2.1:  We  will  show  if  f  is  measurable  w.r.t.  then, 


ifa^A  p{w:  |  ~  l  l(w-  =  a)(w)  •  f.(w)  -  ~  l  P(a)  •  f  f(w)|  >  e}  <  2 


-e2n 


JL=  1 


Jb=1 


It  is  easy  to  see  that  this  implies  the  lemma. 


Proof:  Define 


Zi  =  I(wi  =  a)  "  P(a) 


Y„  =  Z.  •  f. 

Ill 


It  suffices  to  show  that  there  exists  A  >  0  such  that 


(D 


2 

-e  n 


h{w:  |A  •  1  Y-|  >  A  •  e  •  n}  <  2  •  e  4 


JU=1 


We  will  show  that  there  exists  A  >  0  which  satisfies: 


(2) 

(3) 


p{w:  A  •  l  L  >  X  •  e  •  n}  <  e 
Jbl 


n{w:  A 


n 

Y,  <  -A  • 
JM 


e  •  n} 


This  implies  (1).  We  will  show  (2).  ( 3 )  can  be  proved  in  a  similar  way. 

Z^  are  independent  w.r.t.  H^,  hence  E(exp(A  •  Z^)  |  H^)  = 

E(exp(A  •  Z^)).  Exp  is  a  convex  function  therefore: 
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(4)  1  =  exp(E(\  •  Z^))  <  E(exp(X  •  Z^)) 

By  the  Taylor  expansion 

%2  _2 
X  •  Z. 

exp(\  •  Z^)  =  1  +  X  •  Z^  +  - - -  +  R(\  •  Z^) 

where 

R(X-  *  Z  ) 

lin  ♦  0  — - - f  =  0 

X  X2  •  Z, 

Hence  there  exists  d  >  0  such  that  if  d  >  X  _>  0 

2  2 

exp(X  •  Zj)  <.1+X»Z^+X  •  Z^ 

which  implies 

(5)  E(exp(x  •  Z^))  <  1  +  X  •  E(Z  )  +  \2  •  E(Z^)  <  1  +  X2 

Z1f...|Zn  are  i.i.d.  from  (4)  and  (5)  we  have 

1  <  E(exp(X  •  l  Z  ))  <  (l  +  X2f 
Jl=1  * 

Claim;  VA  1  U<n 

(6)  E(exp(X  •  Yx)  |  H^)  <  E(exp(X  •  Z^)  |  H^)  =  E(exp(X  •  Z^) 

Proof :  When  =  1  Y^  =  Z^.  When  =  0  Y^  =  0  and  thus  the 
left  expression  equals  one  while  the  right  expresson  is  greater  than  or 
equal  to  one. 

Claim:  1  £  j  i  n 

j  0 

E(exp(X  •  l  Y  ))<E(exp(X»  l  Z  )) 

A=1  Jtl 


(7) 
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Proof:  By  induction.  For  3  =  1  the  claim  follows  from  (6). 

Let  j  >  1 

3  3-1 

E(exp(X  •  l  Y  ))  =  E(E(exp(X  •  [  !.)  •  exp (X  •  Y  )|H  )) 

1=1  *  1=1  33 

1— "1 

since  Y^  Is  measurable  w.r.t.  H.. 

3"1 

=  E(exp(X  •  l  Y  )  •  E(exp(\  •  Y  )|H  )) 

1=1  3  3 

<  E(exp(X  •  Z.))  •  E(exp(X  •  l  Y  ))  by  (6) 

J  1=1  X 

<  E(exp(X  •  Z.))  •  E(exp(X  •  £  Z  ))  by  the  induction  hypothesis 

"  j  1=1  * 

3 

=  E(exp(x  •  J  Z^))  since  Z^*...*Z,  are  i.i.d. 

1=1 

From  (7)  and  (5)  we  get 

E(exp(X  •  I  Y  ))  <  (1  +  X2)n 
1=1  x 

By  Chebychev  inequality 

n  2  n 

p{w:  X  •  J  Yp  >  X  •  £  •  n}  •  exp(X  •  e  •  n)  <.  (l  +  X  ) 

1=1  * 

n  2 

p{w:  X  •  l  . Y  >  X  •  e  •  nl  <  (l  +  X  )  •  exp(-X  •  e  •  n) 

1=1 
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2 

-  e  n 


For  \  =  —  the  right  expression  is  less  than  e 


(since 


2 

e  n 


(l  +  \2)n  <  e  *  ),  hence  for  0  <  e  <  2d  (d  is  the  constant  derived 

from  the  Taylor  expansion  of  Exp)  (2)  is  satisfied. 


Proof  of  Lemma  2.2:  Denote: 


A  (w)  -  the  number  of  times  (i,j)  was  played 
**■  J 


2  nJC 

x  (w)  -  the  number  of  times  A  played  j.  x.(w)  =  L_1  A.  .(w), 

j  J  i  ~ '  i  3 

From  the  definitions 


A  .(w)  =  l  l(w  =  i)(w)  •  f.0(w) 


ij 


A=1 


jA 


•  p.  .  i  Pi  •  t  (.) 


JL=1 


Hence  if  w  e  Q  -  S  _  then  for  every  1<i<k  1  <  j  <  h 

n  e,A2  ~  ~  “  ~ 


1 

Vw) 

-  P. |  <  ' 

1 

x.(w) 

J 

From  this 

x.(w) 

i  .1 

k 

*  (  l 

i=1 

A  .(w) 
i-i 

'  n 

Xj(w) 

< 

x.(w) 

n 

k 

*  l 

1=1 

Aii(w) 

1  -  -  ] 

For 

every  1 

<  J  < 

k 

h  l  1 

i=1 

Hence  summing 

over  j 

gives 

the  result 

FOOTNOTES 


I  prefer  not  to  use  the  term  "bounded  rationality"  here,  since  it 
has  been  used  as  a  name  for  a  large  class  of  limitations.  For 
example  partial  information  of  possible  actions  or  of  consequences 
of  a  given  action  has  also  been  referred  to  as  "bounded  rational¬ 
ity"  even  when  there  is  no  computation  that  will  give  additional 
information. 

Since  the  set  of  the  states  of  each  automaton  is  finite  the 
automata  enter  a  cycle,  i.e.  there  exists  numbers  c, 

12  12  12 
k  _<  |M  |  |M  |  such  that  for  every  t  >_  c  (qt.Qt)  =  (qt+k»qt+k) 

and  so  the  limit  (l)  exists. 
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